National Repository of Grey Literature 8 records found  Search took 0.00 seconds. 
Audio-to-Score Alignment tool
Búliková, Tereza ; Sládok, Ondřej (referee) ; Kiska, Tomáš (advisor)
This thesis deals with obtaining spectra and chroma features from audio records. Features are used in synchronization algorithm Dynamic Time Warping. This algorithm is used to create synchronization programs Audio-to-audio and Audio-to-score alignment.
Vamp Plugin for Sonic Visualiser
Pilát, Peter ; Zvončák, Vojtěch (referee) ; Kiska, Tomáš (advisor)
In my Bachelor Thesis, I devote myself to obtaining information from music, the way it can be obtained, the aspect of musical information, and the use of the methods themselves. Then I analyze content-oriented music management methods and also include parameterisation of music recordings and audio signal overall. After familiarizing with the specific parameterization tools to implement the Vamp plugin, which are Sonic Visualiser and Sonic Anotator, I characterize the Vamp Plugin and explain in detail its composition. As explained in the manuals and the calculations in progress, the RMS calculation function of the given signal with the possibility of segmentation functions as well as the function of displaying the sound rate or possible changes in the track temperature. Last but not least, we mention the possible use of these supplements in the future and in different sectors.
A tool for simultaneous playback of multiple composition interpretations
Švejcar, Michael ; Ištvánek, Matěj (referee) ; Miklánek, Štěpán (advisor)
The purpose of this Bachelor’s thesis was to create a piece of software which enables the user to simultaneously play back multiple interpretations of a musical piece and switch between them instantaneously. This was achieved using the App Designer in the MATLAB environment, which is intended for developing applications with graphical user interface. The key to the development of the application was especially the use of available toolboxes and algorithms for computing chromagrams and multiscale dynamic time warping. The final IntSwitcher player enables the user to load two recordings of interpretations of one song. Chromagrams which characterize the individual recordings in terms of tonal development over time are first calculated from the input files. After that, the multiscale dynamic time warping method is applied on the chromagrams, which outputs the warping path. The warping path in this case is a matrix, in which musically corresponding samples of loaded audio files are assigned together with the resolution of 50 ms. From this, the corresponding time position of currently inactive track is computed along with its slider position. If the user switches the currently played recording, the second track starts playing in the same part of composition, even if that part is at a different time in each of the individual recordings. The final software is an appropriate tool for studying differences between various interpretations of the same musical piece.
System for finding duplicate recordings based on audio information
Švejcar, Michael ; Miklánek, Štěpán (referee) ; Ištvánek, Matěj (advisor)
This diploma thesis discusses different methods of detecting duplicates in a music file database. The problem at hand is that files containing the same recording may differ in sound quality, applause at the end of a performance and other such parameters. The aim of this thesis is to design and implement a system that identifies duplicate recordings and provides an output file for the comparison. The system needs to not be affected by the mentioned parameters but precise enough to prevent matching non-identical recordings. The system is realized using the Python programming language, freely available libraries for computing chroma features, Image Hashing technique and multiple variants of the dynamic time warping algorithm. Three comparison methods were implemented in the system, differing in precision and computation complexity. The methods were then tested on a prepared dataset and four preset precision options were created. The final system seems very precise and insusceptible to detecting recordings that are very similar but not identical as duplicates, for example in case of different interpretations of the same musical piece.
System for finding duplicate recordings based on audio information
Švejcar, Michael ; Miklánek, Štěpán (referee) ; Ištvánek, Matěj (advisor)
This diploma thesis discusses different methods of detecting duplicates in a music file database. The problem at hand is that files containing the same recording may differ in sound quality, applause at the end of a performance and other such parameters. The aim of this thesis is to design and implement a system that identifies duplicate recordings and provides an output file for the comparison. The system needs to not be affected by the mentioned parameters but precise enough to prevent matching non-identical recordings. The system is realized using the Python programming language, freely available libraries for computing chroma features, Image Hashing technique and multiple variants of the dynamic time warping algorithm. Three comparison methods were implemented in the system, differing in precision and computation complexity. The methods were then tested on a prepared dataset and four preset precision options were created. The final system seems very precise and insusceptible to detecting recordings that are very similar but not identical as duplicates, for example in case of different interpretations of the same musical piece.
A tool for simultaneous playback of multiple composition interpretations
Švejcar, Michael ; Ištvánek, Matěj (referee) ; Miklánek, Štěpán (advisor)
The purpose of this Bachelor’s thesis was to create a piece of software which enables the user to simultaneously play back multiple interpretations of a musical piece and switch between them instantaneously. This was achieved using the App Designer in the MATLAB environment, which is intended for developing applications with graphical user interface. The key to the development of the application was especially the use of available toolboxes and algorithms for computing chromagrams and multiscale dynamic time warping. The final IntSwitcher player enables the user to load two recordings of interpretations of one song. Chromagrams which characterize the individual recordings in terms of tonal development over time are first calculated from the input files. After that, the multiscale dynamic time warping method is applied on the chromagrams, which outputs the warping path. The warping path in this case is a matrix, in which musically corresponding samples of loaded audio files are assigned together with the resolution of 50 ms. From this, the corresponding time position of currently inactive track is computed along with its slider position. If the user switches the currently played recording, the second track starts playing in the same part of composition, even if that part is at a different time in each of the individual recordings. The final software is an appropriate tool for studying differences between various interpretations of the same musical piece.
Audio-to-Score Alignment tool
Búliková, Tereza ; Sládok, Ondřej (referee) ; Kiska, Tomáš (advisor)
This thesis deals with obtaining spectra and chroma features from audio records. Features are used in synchronization algorithm Dynamic Time Warping. This algorithm is used to create synchronization programs Audio-to-audio and Audio-to-score alignment.
Vamp Plugin for Sonic Visualiser
Pilát, Peter ; Zvončák, Vojtěch (referee) ; Kiska, Tomáš (advisor)
In my Bachelor Thesis, I devote myself to obtaining information from music, the way it can be obtained, the aspect of musical information, and the use of the methods themselves. Then I analyze content-oriented music management methods and also include parameterisation of music recordings and audio signal overall. After familiarizing with the specific parameterization tools to implement the Vamp plugin, which are Sonic Visualiser and Sonic Anotator, I characterize the Vamp Plugin and explain in detail its composition. As explained in the manuals and the calculations in progress, the RMS calculation function of the given signal with the possibility of segmentation functions as well as the function of displaying the sound rate or possible changes in the track temperature. Last but not least, we mention the possible use of these supplements in the future and in different sectors.

Interested in being notified about new results for this query?
Subscribe to the RSS feed.